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Abstract 

We consider the problem of efficiently enumerating the satisfying assignments to AC cir- 
cuits. We give a zero-error randomized algorithm which takes an AC circuit as input and 
constructs a set of restrictions which partition {0, 1}™ so that under each restriction the value 
jyj ' of the circuit is constant. Let d denote the depth of the circuit and en denote the number 

of gates. This algorithm runs in time \C\2 ntyl ~^ cd ^ where \C\ is the size of the circuit for 
Mc,d > I/O (1§ C + d\gd) _ with probability at least 1 — 2 _n . 

As a result, we get improved exponential time algorithms for AC circuit satisfiability and 
for counting solutions. In addition, we get an improved bound on the correlation of AC circuits 
with parity. 

As an important component of our analysis, we extend the Hastad Switching Lemma to 



(N 



*yv , handle multiple k-CNFs and fc-DNFs. 
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1 Introduction 

The Circuit Satisfiability problem, deciding whether a Boolean circuit has an assignment where it 
evaluates to true, is in many ways the canonical NP-complete problem. It is not only important 
from a theoretical point of view, but pragmatically, since search problems directly reduce to Circuit 
Satisfiability without increasing the size of the search space. In contrast, reducing problems such 
as planning or model-checking to the special cases of CNF Satisfiability or 3-SAT can increase the 
number of variables dramatically. On the other hand, cnf-sat algorithms (especially for small 
width clauses such as fc-CNFs) have both been analyzed theoretically and implemented empirically 
with astounding success, even when this overhead is taken into account. For some domains of 
structured CNF formulas that arise in many applications, state-of-the-art SAT-solvers can handle 
up to hundreds of thousands of variables and millions of clauses [TTj . Meanwhile, very few results 
are known either theoretically or empirically about the difficulty of Circuit Satisfiability. 

This raises the question: for which classes of circuits are there non-trivial Satisfiability algo- 
rithms? More precisely, for an algorithm that decides satisfiability for circuits in a class C, we 
express its worst-case running time as \C\2 n ^~^' where C is a circuit with n inputs. We say that 
/i is the savings over exhaustive search and write it in terms of n and the parameters of the class 
C, for example, the ratio of gates or wires to the number of inputs, and depth. The larger a is, the 
more non-trivial the algorithm can be considered. When can we exploit the structural properties of 
C to obtain savings over exhaustive search? What are the best savings for various circuit classes? 
How is the expressive power of a circuit class related to the amount of savings for its satisfiability 
problem? 

Williams' recent work |20|. I2T] has given an additional motivation to look at improved algo- 
rithms for Circuit Satisfiability problems. He shows that Circuit Satisfiablity algorithms for a class 
of circuits with even a very small improvement (u = w(log n/n) ) over exhaustive search imply 
circuit lower bounds for that class. Thus, he formally proves a connection showing that the same 
understanding of the limitations of a circuit class is needed both for improved algorithms and to 
prove lower bounds. 

Among the satisfiability problems for various circuit classes, the k-SAT problem has attracted 
the attention of several researchers since Monien and Speckenmeyer [12] first showed that it can be 
computed in less than 2 n time in the worst-case, that is, in time \F\2 n ^ 1 ~^ k > for \i% > where F 
is a A;-cnf. Researchers have obtained a series of improvements for u^, with particular attention 
to the case k = 3. Currently, the best known savings for 3-SAT is about 0.614 for randomized 
algorithms [144 19] and about 0.4157 for deterministic algorithms [181 H3j. There are a suprisingly 
wide variety of state-of-the-art algorithms for &-SAT, but they all achieve savings /U& of the form 

e(i/k) [Ha EH]. 

Much less is known about the nature of savings for satisfiability problems for more expressive 
circuit classes. Schuler [19] has shown that the satisfiability of an m-clause CNF F can be deter- 
mined in time |F|2 n ^ 1+igW. Using a bit more careful analysis, Calabro et al. [3 J improved this 
from savings I/O (lgm) to I/O (lg ^), giving constant savings when m = O (n). 

Subsequently, Calabro et al. [4J have considered the question of satisfiability of bounded- 
depth unbounded fan-in circuits over standard basis (AC circuits) and shown that the sat- 
isfiability of AC circuits C of size en and depth d can be decided in time |C|2 n ( 1_/ ^ where 
fi > l/0(c 2 _1 lg _ c). Santhanam [17] has considered cn-size formulas F with no depth 

restriction and showed that the satisfiability problem for such formulas can be solved in time 

|_p|2 n V ~ poiy(c)) _ More recently, Williams [21] has shown that the satisfiability of ACC circuits C 

can be solved in time \C\2 n ut - n > . 



In this paper, we return to AC circuits and seek further improvement in the savings for the 
satisfiability algorithm for AC circuits of size en and depth d. While the algorithm in [3] obtains 
savings in terms of c and d independent of n, its double exponentially small savings diminish rapidly 
to zero for d > 2 when c grows as a function of n. While Williams' algorithm [21] provides nontrivial 
savings even when c grows sufficiently large as a function of n, its savings decreases with n even 
when c and d are constants. Furthermore, it leaves open the question whether one can obtain better 
savings for AC circuits. 

Another independent motivation is the natural connection to proving lower bounds. Paturi et 

al. [15] have observed that the analysis that led to an improved upper bound for k-SAT can also 

be used to prove lower bounds for depth-3 AC circuits. Using this approach, they obtain a tight 

lower bound of Q(ni2^) on the number of gates required to compute the parity function with 

depth-3 circuits. Subsequently, [Hj proposed a resolution-based fc-SAT algorithm and obtained an 

improved savings (by a constant factor) using a sophisticated analysis. Using the same analysis, 

they construct a fairly simple function (checking whether the input binary string is a codeword 

of a certain code) which requires at least 2 L282 v^ size for any depth-3 AC circuit. This is the 

best-known lower bound for depth-3 circuits for any function. Although depth reduction techniques 

based on Switching Lemma [SJISMI] and especially the top-down technique of [7] for depth-3 circuits 

prove lower bounds that are close to 2^, it is not clear that these techniques by themselves would 

yield a lower bound of 2 c v n for c > 1. 

i 

It has been a long open problem to prove 2^ n ~ ' size lower bounds for AC circuits of depth 
d. Since the breakthrough results of Yao and Hastad in the mid 1980's, there have been only 
modest improvements |22} [5] and only for for depth-3 circuits [14]. It is tantalizing to prove 
better lower bounds by exploiting the connection between AC satisfiability upper bounds and 
lower bounds. Unfortunately, we do not have any ideas for proving strong enough upper bounds 
for AC satisfiability that would imply better lower bounds. Our modest goal is to obtain a 
satisfiability algorithm for AC with savings sufficient enough to imply the some of the best-known 
lower bounds. 

1.1 Our Results and Techniques 

Our main result is an algorithm for AC satisfiability with the best known savings. 

Theorem 1.1. There is a Las Vegas algorithm for deciding the satisfiability of circuits with en 
gates and depth d whose expected time has savings at least ' 
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The above algorithm immediately follows from the existence of an algorithm that enumerates 
all satisfying assignments by partitioning the space into sub-cubes where the circuit is constant. 

Theorem 1.2. There exists a Las Vegas algorithm which, on input a en gate, depth d circuit C 
in n variables, produces a set of restrictions {pi}i which partition {0, l} n and such that for each i, 
C\ Pi is constant. The expected time and number of restrictions are both poly(n)\C\2 n ^ 1 ~^ c - d ' , where 



the savings u c4 is at least 0(lgc+dlgd) d-i • 

This result implies the known, almost tight, lower bounds for depth-size tradeoffs for computing 
parity. This is not surprising, because we use a version of the Hastad Switching Lemma that was 
used to prove these lower bounds. However, our results also imply new, very tight, bounds on how 
well constant-depth circuits can approximate the parity. As another corollary to Theorem 11.21 we 
obtain the following bounds on correlation between AC circuits and the parity function. Recently 
and independently from this work, Hastad [6] achieved a similar bound on correlation (although 
our result is better for small values of m). 



Theorem 1.3. The correlation of parity with any AC circuit of size en and depth d is at most 

2-Mc.d" _ 2-™/0(i°g c + dlo g d ) d_1 i 

Some particularly interesting special cases are: 

1. For linear sized families of circuits, with c,d constant, we obtain a constant savings u and a 
strongly exponential bound 2~®( n ' on the correlation with parity. 

([4] also gives constant savings for Satisfiability, but our constant improves on theirs more 
than exponentially and also holds for counting.) 

2. We obtain non-trivial savings and correlation bounds for m up to 2 ( n '. 

Independently, Hastad [6] achieved a similar result for correlation. Beame, Impagliazzo, and 
Srinivasan [2] also give an improved algorithm and correlation bound, but only for m up to 
a quasi-polynomial size. 

3. For /c-CNF's, we extend the known savings u = Q(l/k) for Satisfiability to also include 
counting and enumeration of solutions. 

The previous best algorithms for counting [10] had savings u ~ 2~ fc . 

Our results seem tight in the following two ways. For covering the set of solutions by sub-cubes, 
we achieve the best possible u up to a constant factor. Also, any algorithm that improved our a 
more than polynomially would prove that NEXP % NC using Williams' technique. 

A key ingredient in our analysis is an extended Switching Lemma, proved in section [ 



Lemma 1.4 (Extended Switching Lemma). Let <f>\, . . . , 4> m be a sequence ofk-CNFs and/or /c-DNFs 
in the same n variables. For any p < 1/13, let p be a random restriction which leaves pn variables 
unset. The probability that the decision tree for (</>i, . . . , 4> m )\ p has a path of length > s where each 
(pi contributes at least one node to the path is at most (13pk) s . 

Using the observation that any path in the decision tree for (</>i, . . . , 4> m )\ p is also a path in the 
decision tree for the subset of fc-CNFs or &;-dnfs which contribute nodes to the path, and using a 
union bound, we get the following corollary. 

Corollary 1.5. Let cj>\, . . . ,<j) m be a sequence of /c-CNFs and/or fc-DNFs in the same n variables. 
For any p < 1/13, let p be a random restriction which leaves pn variables unset. The probability 
that the decision tree for (</>i, . . . ,(f> m )\ p has a path of length > s is at most (2 m — l)(13pk) s . 

In the rest of the paper, we provide detailed algorithms and analysis to support our results. 

2 Notation 

We consider layered boolean circuits with alternating layers of AND and OR gates, and where 
negations are all at the input level. We refer to a circuit with d layers of gates as a depth d circuit. 
We number the layers from the output gate , layer 1, to the inputs and their negations (layer d+1). 
For circuits on n inputs, we parameterize them by m = en, the maximum number of gates in a 
layer and d, the number of layers. We call such circuits (n,m,d) -circuits, where m and d could be 
functions of n. For technical reasons we are also interested in a slight variant of (n,m, d)-circuits 
where we only require that each gate at level d has fan-in bounded by k (rather than limiting the 
number of gates at level d), for some k. All other layers are still required to have at most m gates. 
We refer to these circuits as (n,m,d,k)- circuits. 



A restriction p on a set of variables V is a map p : V — > {0, 1, *}. We say that the variables v 
where p{v) = * are unset. For Boolean function / on all n inputs, f\ p is the restricted function on 
the unset variables. For a circuit C, C\ p is the restricted circuit on the unset variables. 

We say that a set of functions 0i,...,0 m : {0, l} n — > {0,1} partitions {0, l} n if for every 
x E {0,1}™ there exists exactly one i such that 4>i(x) = 1. The i'th region of the partition is 
the set of x so that cf>i(x) = 1, and we identify the region with the function c/)j. In particular, we 
are interested in partitions defined by functions of the form R A p, where R is a /c-cnf and p is 
a restriction, which we will abbreviate as 1Z = (R,p)- We say that two circuits, C and D, are 
equivalent in a region IZ'iflZ =^- (C = D). 

Definition 2.1. Let C be a circuit in n variables. We say that a set V = {(TZi = (Ri, Pi),Ci)}i is 
a partitioning for C if {7£j}j defines a set of regions which partition {0, 1}™ and for every i, C is 
equivalent to C, in the region IZi. We say that C, is the circuit associated with the region IZi. 

We generalize this definition to sequences of circuits in the natural way. 

Define the height of a decision tree T, height(T), as the length of the longest path. We will 
view paths in decision trees as restrictions in the natural way. The canoncial decision tree for a 
CNF <fi, tree{4>) is constructed as follows: If any clause is empty, return 0. If there are no clauses, 
return 1. Otherwise, let C be the first clause. Query all the variables in C in order. Restrict (j) by 
the results of these queries and recurse. The canonical decision tree for a dnf is analogous. 

We also similarly define the canonical decision tree for a sequence of cnfs and/or dnfs $ = 
(01, . . . , 4>e). First construct the canonical decision tree for (j}\. Along each path, restrict cp2, ■ ■ ■ , 4>£ 
by the answers to queries and recurse. Label the leaves of the resulting tree with the tuples of 
leaves from the original trees. 

When we construct the canonical decision tree for a CNF eft, we group variables by the clause 
we are at when we query them, and say the clause contributes the corresponding set to the path. 
Similarly for any subcircuit ip of 4>, we regard the set of nodes contributed by all the occurrences 
of variables in ip as the set of nodes contributed by ip. 

3 Main Algorithm 

Schuler [19] gives an algorithm for cnf-sat with savings 0(1/ log c). He does this by reducing the 
CNF to a moderately exponential sized set of 0(logc)-CNFs through a case analysis, then using 
known /c-SAT algorithms such as [15J to get overall savings l/0(logc). Like 0], our algorithm can 
be thought of as generalizing Schuler's approach to larger depths, where each step performs a case 
analysis to reduce a single circuit of depth d to a moderate-sized collection of circuits which are all 
depth d—1. Calabro et. al. use a complex, but local, form of Schuler's case analysis, branching on 
large constant sized sub-formulas of the input formula. Here, we use a more global case analysis, 
based on Hastad's Switching Lemma. This global treatment was in part inspired by an algorithm 
of Santhanam [T7] for formula satisfiability, that constructs a decision tree for the formula whose 
paths are short on average, although there might not be a small depth decision tree for the formula. 
Using a Switching Lemma to convert depth d circuits to depth d — 1 circuits is standard. 
However, to achieve the claimed savings with Switching Lemmas is not at all straight-forward. 
Assume that we have converted the bottom two levels of our circuit to fc-CNFs. The main idea is 
that after a random restriction setting all but about n/k variables, with high probability, each of 
these sub-circuits is equivalent to a small depth decision tree. View the random restriction as first 
picking the set of variables to restrict, then the values. For a randomly chosen set of variables to 
restrict, our algorithm will perform an exhaustive search over all settings of the values. If for a 



certain setting, the sub-formulas all become decision trees of depth k' , we can write them all as 
fc'-DNFs and combine with the level of OR gates above. Then we can hope to get the recursive 
savings on these branches for depth d — 1 circuits on n' = Q(n/k) variables. 

So there are two factors that limit our savings with this approach: Even if the formula became 
constant after every restriction, we cannot get savings more than 1/k, since we use exhaustive 
search on n — n/k variables.. On the other hand, there is a failure probability (exponentially small 
in k' ) where our sub-formulas might not become small depth decision trees. For these branches , 
we might not get any savings. So our savings is also bounded by roughly k' /n, the log of the failure 
probability as a fraction of n. Since our value of k' in this depth becomes the new value of k, it is 
hard to see how to get savings more than 1/y/n by this type of argument, even for depth 3. 

We get around this by taking a more error-tolerant approach. We don't assume that all of our 
sub-formulas become small depth decision trees, just most of them. This is still problematic, because 
the sub-formulas might be identical or close, so the events that they become small depth decision 
trees might be highly correlated. To handle this, we look not at the decision tree complexity of a 
single sub-formula but at sets of sub- formulas, where the decision tree has to compute the value 
of each one. If several formulas become high depth for essentially the same reason, once we've 
evaluated one of them, the others should become small depth, so the combined decision tree will 
still have relatively small depth. We give an extended switching lemma that proves that it is 
(almost) exponentially unlikely that there is a large set of sub-formulas who all contribute many 
variables to their joint decision tree. Thus, intuitively, with extremely high probability, either only 
a few sub-formulas remain complex, or the ones that do all involve the same moderate sized subset 
of variables. 

Our algorithm then does a case analysis over which of the sub-formulas are in this set of mutually 
complex ones, and over all paths in their joint decision tree. For each, the other sub-formulas are 
equivalent to fc-DNFS by definition. Because we are not insisting that all sub-formulas become 
small, we can pick k relatively small and still have an extremely small chance of failure, k affects 
the overhead for the case analysis (we'll have to branch on which set of at most n/k of the m 
sub-formulas stay complex, which is relatively small for k = O(logc) ), which is determined by the 
length of the joint decision tree. We can let this be a constant fraction of the remaining variables, 
and so get a failure probability exponentially small in n/k rather than k. 

Here, we describe the overall structure of the algorithm. In Section lBTTI we describe the algorithm 
for converting one level of CNFs to dnfs and vice versa, and in Section 13.21 we prove our extended 
switching lemma. 

We construct a partition for an (n, m, d)-circuit into regions and associated (n', m, d, fe)-circuits 
by using a technique of Schuler [19]. For the sake of exposition, we assume that the bottom level 
gates are V gates. The other case can be handled in an analogous fashion. While there exist a 
bottom level gate <f> of fan-in greater than k, branch on the disjunction of the first k inputs of <p. 
If the disjunction is true, we replace the gate by the smaller k input disjunction. If the disjunction 
is false, we set the value of the k variables in the disjunction. We repeat this branching process 
until all bottom level gates have fan-in at most k. This step reduces the overall savings by about 
2~ k m/n. 

Lemma 3.1 (Bottom Fan-in Reduction Algorithm). Let C be a (n,m,d) -circuit and let k > 1 
be a parameter. There exists an algorithm which outputs a partitioning V = Uo</<„,/A;P/ for C , 
where the sets Vf are disjoint and for each f , Vf contains at most ( ^ ) regions with associated 
(n — fk,m,d,k)-circuits. The algorithm runs in time poly(n) ■ \C\ ■ \V\. 

Proof. Let C be a (n, m, d)-circuit and let k > 1 be a parameter. Let R be an empty (true) fc-CNF 



and let p be a restriction where all variables are unset. Assume that the bottom level gates are V 
gates. 

While there exists a bottom level gate (ft with fan-in greater than k, let (ft' denote the disjunction 
of the first k inputs of (ft. Branch on (ft' . In the branch where (ft' is true, replace (ft with (ft' in C and 
replace R with R A (f)'. In the branch where (ft' is false, replace p with p A -k//, viewing -i0' as a 
restriction which sets the literals in (ft' to false and replace C with C|-,<a'. 

When all bottom level gates in C have fan- in at most k, output (1Z = (R, p),C). 

Along any path in the computation tree of this algorithm, let / denote the number of branches 
where (ft' is false. Note that each path has at most m branches where (ft' is true since each such 
branch reduces the number of bottom level gates with fan-in greater than k. For each value of /, 
there are at most ( m ~f ) paths. At the end of each such path, fk variables are set by the false 
branches, resulting in (n — fk, m, d, /c)-circuits. □ 

Next, we repeatedly reduce the depth of (n, m, i, fc)-circuits by one until it reaches depth 2 
(either a fc-CNF or a /c-dnf). Let <J> be the sequence of subcircuits of an (n, m, i, /c)-circuit at depth 
i — 1. Assume without loss of generality that the subcircuits are /c-dnfs. The main technical 
ingredient for depth reduction is an algorithm which constructs a partition which allows us to 
transform a sequence of fc-DNFs into a sequence of equivalent A;-cnfs in each region, or vice versa. 
This algorithm will be described in detail in Section 13. 1L We apply this algorithm to transform <J> 
into sequences of fc-CNFS. Since the gates at level i — 1 change from V to A, they may be combined 
with the gates at level i — 2 to reduce the depth by one without increasing the number of gates at 
any levels % — 2 or higher. 

Lemma 3.2 (Depth Reduction Algorithm). Let C be a (n,m,d,k) -circuit and let < q < 1/2 

be a parameter. There exists a randomized algorithm which outputs partitioning V for C where 
the circuit associated with each region of "P is a (j$)k, r n,d — l,k)- circuit. With probability at least 
1 — Q; 1^1 < s an d the algorithm runs in time poly(n) • lg - • \C\ ■ s where s < j^ • 2 n ~ Toofc" 1 " 3 m . 

The depth reduction algorithm follows from Lemma [3.31 given below and proved in Section [3.11 

Lemma 3.3 (Switching Algorithm). Let $ = (<fti, . . . ,<ft m ) be a sequence of k-CNFs in n variables 
and let < q < 1/2 be a parameter. There exists a randomized algorithm which takes $ as input 
and outputs a partitioning V for $ where the circuits associated with each region of V are /c-DNFs 
in at most j^ variables. With probability at least 1 — q, \P\ < s and the algorithm runs in time 
poly(n) ■ lg | • |*| • s where s < j^ • 2 n ~Tsk +3 ~ m . 

The case where each (ftj is a A;-CNF and the circuits in each region of V are &-DNFS is symmetric. 

Proof of Lemma \3. E Let C be a (n, m, d, fc)-circuit and let < q < 1/2 be a parameter. Let 
$ = (0i, . . . ,(ftm) be the sequence of subcircuits at rooted at level d — 1 in C (/c-CNFs or A;-dnfs). 
Run the Switching Algorithm (Lemma 13. 3p on * to get {(IZi, *j = (ifti,i, • • • , ifti,m))}i (&i will be a 
sequence of fe-DNFs or fe-CNFs). For each i, let Cj be the circuit resulting from replacing <fti, . . . , (ft m 
with iftn, . . . , tpi m in C and then combining the gates at level d — 2 and d — 1 (which will be the 
same type of gates). Output (1Zi,Ci). 

By Lemma f3.3| each ^j is a depth 2 circuit in at most jt^ variables. Therefore, each C{ will be a 
(iSSfc ' m ,d—l, fc)-circuit after combining the gates at levels d— 2 and d—1. With probability at least 
1 — q, the algorithm of Lemma [3?3] produces V satisfying \P\ < s and runs in time poly(n)-lg --|*| -s 

where s < j^ ■ 2 n ~ wok + m . This algorithm produces a partition of the same size and increases 
the running time by an additive 0(|C| • IT 7 !). □ 



This lemma does d — 2 steps of depth reduction. 

Lemma 3.4 (Repeated Depth Reduction Algorithm). Let C be a (n,m,d,k)- circuit and let < 
q < 1/2 be a parameter. There exists a randomized algorithm which outputs a partitioning V for C 
where the circuit associated with each region is a ( n oofc1 d - 2 > m > ^, & ) -circuit (either a k-CNF or a 
k-DNF). With probability at least 1 — q, \V\ < s and the algorithm runs in time polyin) ■ lg ^ • \C\ ■ s 
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-k 



where s < (2n) d ~ 2 , 2 "-7^^2+(^-2)3 

wnere s ^ (i ofc)( d - 1 )( d - 2 )/ 2 

Proof. We will prove Lemma 13.41 by induction on d. If d = 2, output ((R = l,p= 1), C) since C is 
already a £;-cnf or /c-dnf and the bounds on V and the running time holds with probability 1. 

If d > 2 we assume by induction that we can run this algorithm recursively on circuits of 
depth d — 1 and that the recursive call will satisfy the properties of the lemma. First, run the 
Depth Reduction Algorithm (Lemma I3.2() on (C, q = q/2) to get V = {TZ%, Ci}i (where each C{ is a 
(uJ5fc ' m > ^ — 1) fe)-circuit). Then for each i, since Cj has depth d— 1, run this algorithm recursively 
on the (Cj,<7 = q/2 n+1 ) to get 7^ = {(7^i,j,Cjj)}j (where each Cjj is a ( noofcW- 2 ' m ' ^' ^)" c i rcm t) ■ 
For each j, output (72-j A TZij, Qj)- 

Say that V is good if |P| < T ^2 n ( 1 "Tok)+ 3_fcm . By Lemma E21 P is good with probability at 

least 1 - g/2. For each i, say that Vi is good if |^| < {100 l 1 ) '$% (d ^ )/2 2 {look){ (wok)*-*™ ' 

(note that this is the result of a recursive call on a (j^v,Tn, d—1, fc)-circuit). By induction, each Vi is 

good independently with probability at least 1— q/2 n+1 . By a union bound, Pr [Vi is good for all i | V is good] > 

1 — q/2, since if V is good then the number of Vis is much less than 2™. The probability that V 

and Vi for all i are all good is at least (1 — q/2)(l — q/2) > 1 — q. In this case, the total number of 

outputs is at most 



. 2 n ( 1 -Tm)+3- km ) ( (Jfe) 2 (T5k)( 1 - nn,,Ma-3 ) + (^-3)3- fc m N 



/ 2n 

VlOOfc " y I (100/fc)( d - 2 )( d - 3 )/ 2 

= (2n) rf - 2 ra( i- 7 _^ )+ ( d - 2 )3-* m 

(lOOfc)^- 1 )^" 2 )/ 2 

D 

When we end up with a /c-CNF or a fc-DNF C in a region defined by a /c-CNF i? (and a restriction), 
we apply a random restriction on a 1 — I/O (k) fraction of variables and then construct a decision 
tree for the pair (C, R) and then argue that the total number of leaves in the decision tree isn't too 
big. Each leaf in the decision tree where R = 1 corresponds to a restriction in the region R where 
the value of the circuit is constant. 

Lemma 3.5 (Depth Two Algorithm). Let C be a k-CNF or /c-dnf and R be a k-CNF each in the 
same n variables and let < q < 1/2 be a parameter. There exists a randomized algorithm which 
outputs a partitioning V for C in the region R where each region on V is defined by a restriction 
and the circuit associated with each region of V is either or 1. With probability at least 1 — q, 
\V\ < s and the algorithm runs in time poly(n) ■ lg - • (|C| + \R\) • s where s < 50 • 2 n ^ ~30k) . 

Proof. Let p = -L-. Choose a set U of pn variables uniformly at random. For each restriction p 
which leaves the variables in U unset, construct the decision tree T for (C,R)\ p . For each path p' 
in T ending at a leaf labeled (b, 1), output (p A p' , b). 

The regions defined by each p/\p' output partition the region R, and in each region C has value b. 
All that remains is bounding the number pairs output. By Corollary ll.5l Pr p [height(tree((C, R)\ P )) > s] < 



3(13/30) s . We bound the expected total number of outputs by first summing over all p and then 
summing over s and using the fact that a decision tree of hight at most s can have at most 2 s 

leaves. 



B[\V\] <E 
u u 



y number of leaves of tree((C, R)\ p ) 
p 
2"( 1 -35fc) E [number of leaves of tree((C,R)\ p )} 



u, P 

n 
30k OO 

< 2 n ( 1 ~w) ^ 2 S 3(13/30) S < 3 • 2"'( 1 -w) ^(26/30) s < 25 • 2 n ( 1 ~w). 

s=0 s=0 

By Markov's inequality, with probability at least 1/2, \V\ < 50 • 2 n \ ~sok). We may repeat the 
algorithm lg - times in parallel with independent choices of U and output the smallest partition. 
This increases the probability of success to 1 — q. □ 

We compose the preceding algorithms to get our algorithm for AC circuits, formalized in the 
following lemma. 

Lemma 3.6. Let C be a (n, m, d)-circuit. Let k > 1 be a parameter. There exists a randomized algo- 
rithm which outputs a partitioning V for C where each region is defined by a restriction and the cir- 
cuit associated with each region is either or 1. With probability at least 1 — 2~ n , \V\ < s and the al- 

(2n) d - 2 n ri 3 "j_i +(rf-2)3~ fc m+4-2- fc max(m,n/fc) 

gorithm runs m time at most poly (n)-\C\-s where s < 50 , J. (d l 1)(d _ 2); / 2 2 (100*0 



Theorem 11.21 follows straightforwardly from Lemma l3T 

1 
Proof of Theorem \1.2[ Let C be a (n, m, <i)-circuit. If m > 2 '"•* then output all 2 n restrictions 

which set all of the variables. Otherwise, we may choose k = (max (lg —,dlgdj) such that 

c;n \ > ^{4-2)3- k m+A-2~ k max(m,n/fc) < orioofcW" 1 

(100/fc)( rf -D( rf - 2 )/ 2 " 

n _— 1 3 _ + o(l) 

and then use Lemma 13.61 In either case, the algorithm outputs at most 2 o(i g ^+di g d) 

restrictions. □ 

For the sake of simplifying the calculations in the proof of Lemma l3.6| we first prove the special 
case where the circuit has bottom fan-in k. 

Lemma 3.7. Let C be a (n,m,d,k)-circuit. There exists a randomized algorithm which outputs a 
partitioning V for C where each region is defined by a restriction and the circuit associated with 
each region is either or 1. With probability at least 1 — 2~ 2n , \V\ < s and the algorithm runs in 

time at most poly{n) ■ \C\ ■ s where s < 50 {WQk y { ™l lKd _ 2)/2 2 n ( 100fc ) d_1 ' "\ 

Proof. Run the Repeated Depth Reduction Algorithm (Lemma 13. 4p on (C,q/2) to get a partition 
V = {((Ri, Pi),Ci)}i for C. Each d is either a A;-cnf or a A;-dnf in ( ", d _ 2 variables. Run the 

Depth Two Algorithm (Lemma 13. 5p on (Ci,Ri,q = q/2 n+l ) to get a partition V% = {{Pi,j,°i,j)}j 
for d. Output (pi Apij,b iyj ). 



Say that Pis good if \V\ < (100 ff ^- 2 )/2 ^ W^ )+( " ^ ^ • By Lemmata Pr [V is good] > 

r(i-^) 



1 - q/2. For each i, say that Vi is good \i\Vi\ < 50 • 2 ( 10M )^ 2 30fc (note that the Depth Two Algo- 
rithm is run on circuits in , ", d _ 2 variables). By Lemma [331 Pr [Pi is good] > 1 — q/2 n+1 . By a 
union bound, Pr [Vi is good for all i \ V is good] > l—q/2, since if P is good then the number of V%s 
is much less than 2 n . The probability that V and V% are all good is at least (1 — g/2)(l — q/2) > 1 — g. 
In this case, the total number of outputs is at most 

/ (2n) rf ~ 2 rt (i-_J^) +(d - 2 )3-* m \ / _ J -^ ( i-^- k) 



V(100fc)( rf -D(^ 2 )/ 2 ' ' ~ 



(2n) d 2 n(l 2- r ^ r )+((i-2)3- fc m 

< 50 - - 9 (ioofc) d - 1 ' v ; 

" (100k)(d-l)(d-2)/2 Z 

D 

Proof of Lemma 13,61 Run the Bottom Fan-in Reduction Algorithm (Lemma 13. ip on (C, fe) to get 
V = Uo</< n /fc"P/ where P/ = {((Rf,i, Pf,i),Cf t i)}i. For each / and i, the circuit C/^ is an (n — 
/fc, m, d, fe)-circuit. Run the algorithm of Lemma 13.71 on (Cfi,q = 2 _2n ) to get a partition Vt% = 
i(Pf,ij)' b f,ij)}j for C/,i- Output (pf }i A p/,jj, 6/,jj). 

By Lemma [3. 1| the sets Py are disjoint and for each /, \Vf\ < ("T ) and the circuits associated 
with each region in Vt are (n — fk, m, d, fe)-circuits. For each / and i, say that Vij is good if \Vf t i\ < 

^ m ^i^ 2 ^ mi ~ T ^^ )+ ^ 2 ^~ km - By LemmaES Pr [V U is good] > 1 - 2" 2 ™, and 
by a union bound over the at most 2 n pairs /, i, all of the sets Vfi are simultaneously good with 
probability at least 1 — 2~ n . In this case, the total number of outputs is at most 

p \ f / (loofc)^- 1 )^- 2 )/ 2 



(2n) rf 2 n( i- 7 _^ :T ) +(d -2)3-*m ^(m + f\ -mi-— |^)+(d-2)3-*m 

(lOOfc)^" 1 )^- 2 )/ 2 t^\f 



(2fl) n(l 2.^— r )+(d-2)3- fe m+4-2- fe max(m,n/fc) 

- (loofe)^- 1 )^- 2 )/ 2 



since 



w™+/yw-aa^)< ^ ( m+n /^ 2 -/ fc = (i+2- fc ) m+n/ 

/=o \ J J /=o V / / 

^ o(lge)2 _fe (m+n/fc) ^ n4-2 _fe max(m,n/fc) 



□ 



3.1 Switching Algorithm 



Lemma (Lemma 13.31 restated). Let $ = (</>i, . . . , </> m ) 6e a sequence of /c-cnfs in n variables and 
let < q < 1/2 6e a parameter. There exists a randomized algorithm which takes $ as input 
and outputs a partitioning V for <I> where the circuits in each region of V are /c-dnfs in at most 
yj?0£ variables. With probability at least 1 — q, \V\ < s and the algorithm runs in time at most 

poly(n) • lg | • |3>| • s where s < i^fc2 n_ i5k+ 3_ m . 
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Let (j) be a &-CNF. Consider the decision tree tree((p) f° r <\>- If the height of tree{<p) is at 
most k', then we can construct a A/-DNF <p' equivalent to <p by taking the disjunction of the terms 
corresponding to paths in tree((p) labeled 1. 

However, in general, k! will be much larger than k. In this case, we split the paths in tree((j>) 
into two categories: "short paths" of length at most k, and "long paths" of length greater than 
k. Since any assignment to a set variables is consistent with exactly one path in any decision tree 
on those variables, we can partition {0, l} n by partitioning the paths. We will construct a £;-CNF 
T((j>, k) which will define the region corresponding to the set of short paths. Rather than defining 
a single region for the set of long paths, we further partition the space and define a separate region 
for each long path (each long path viewed as a restriction defines a region). 

Formally, for any /c-cnf (p and any k > 1, let £' = {a[, . ,. ,a'g,} be the set of paths of length 
greater than k in tree(<p). Let £ = {o"i, . . . , ag} be the set of paths of length k in tree((p) that do 
not end at a leaf (equivalently, £ consists of the paths in £' truncated after k variables). Define 
T(4>, k) = -i(7i A -i(T2 A • • • A -\(Tg where each ctj is viewed as the conjunction of the literals along the 
path. Note that T((f>, k) is a £;-cnf. An assignment is in the region corresponding to short paths 
if and only if it is not consistent with any a^. Since the paths a' i form complete decision trees after 
the first k variables along each path, an assignment is not consistent with any a'j if and only if it is 
not consistent with any (7j, and therefore if and only if it satisfies T(<j>, k). 

The algorithm will branch on the region T((j), k) and on the regions corresponding to long 
paths. In the region T(<p,k), we can do as we did before and convert (p into an equivalent /c-DNF 
by only considering the short paths ending with 1. Formally, for any A:-CNF <p and any k > 1, let 
S(cj>, k) = t\ V T2 V • • • V T£ where t±, . . . ,T£ are the paths of the decision tree for <j> of length at most 
k with label 1 where each n is viewed as a conjunction of the literals along the path. S(4>, k) is a 
&-DNF and is equivalent to (j) in the region T(4>, k). In the regions correspond to long paths, (p is 
constant and therefore trivially a /c-DNF. 

We repeat this process for each fc-CNF (p\, . . . , <p m in order, each time recursively partitioning 
the current set of regions. 

The problem with this approach is that we may well branch more than 2™ times between 
branching on T(4>, k) and branching on each long path. We solve this problem by first applying a 
random restriction which leaves a small constant fraction of the variables unset, and then do depth 
reduction as described above. In this case, we can bound the probability that the total number of 
variables set along long paths will be too large using Lemma 11.41 



Proof. Let p = j^kj:- Choose a set U of pn variable to leave unset uniformly at random. Branch on 
each restriction po which leaves the variables in U unset. Let p = po an d let R denote an empty 
(true) fc-CNF. For each <pi in order, branch on T(cpi\p, k). 

In the region p A T(4>t\ p ,k), (pi is equivalent to S(4>i\ p ,k) so set ipi = S((pi\ p ,k). Let R = 
R A T(4>i\ p , k). In this branch, say that (pi is "not targeted." 

When T((pi\ p , k) is false, we further branch on each path p' in tree{(pi\ p ) of length greater than 
k. Let p = p A p' and let ipi = b' where b' is the label at the end of the path p' . In each of these 
branches, say that (pi is "targeted." 

Once we have branched in this fashion for each (pi, output the resulting (1Z = (R,p),^ = 
(ip-L,...,ip m )). 

This algorithm naturally defines a computation tree: First branch on each restriction po> then 
for each (pi in order branch on whether 0, is targeted, and if (pi is targeted branch on each long 
path. Each leaf of this computation tree correspond to a region in partition output. Define the 
"type" of each leaf as the sequence of targeted /c-cnfs along the path to the leaf. We will group 
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the leaves of the computation tree both by type and by parent restriction po i n order to bound the 
expected number of leaves. 

For any po and any type (sequence of targeted &;-CNFs) T, let G POt T denote the set of leaves of 
type T with parent restriction p$. Consider the decision tree tree(T\ po ). Let P P0t T denote the set of 
paths in tree(T\ PQ ) where each fc-CNF in T contributes at least k + 1 variables to the path. The set 
of paths P P0t T correspond exactly to the leaves in G POt T- The path in P P0t T corresponds to which 
branch p' is taken at each targeted A;- CNF (each p' sets at least k + 1 variables since only paths of 
length > k are considered when branching). 

By Lemma 11.41 Ptjj iPq [tree(T\ po ) has a path of length > s where each fc-CNF in T contributes 



at least one variable] < (13/100) s , 
tree of height s can have at most 2 s 



which gives B UtPQ [|P P0 ,t|] < 



£s=o 2S ( 13 / 100 ) £ 



(any decision 



leaves). Since we only consider paths in tree(T\ po ) where each 



fa G T contributes at least k + 1 variables, any such path must have length at least \T\(k + 1). We 
bound the expected number of outputs by summing over restrictions po, then path lengths s and 
then sets of targeted £;-cnfs T of size at most s/(k + 1). 

e [\n = E E f T i\ p poM) = i n ~ pn E „ E ti p po,Ti] 



u 



Po TC{0i 
<- cyn—pn 



-} 



pn 



Y V2 s Pr 

L ^ 1 /L ^ u, Po 
rc{^,.,^}«=o 



pn 



ryn—pn \ "* \ "* 



--0TC {<fiu-,<M 

\T\<s/(k+l) 

pn [s/k] 



2 s Pr 

U,po 



U,po 

TC.{<t>x,...,<t>m} 

tree(T\ po ) has a path of length > s 
where each fa £ T contributes > k vari- 
ables 

tree(T\ po ) has a path of length > s 
where each fa £ T contributes > k vari- 
ables 



pn ls/Kj . . pn , v 



grouping terms with the same value of [s/k\ and replacing [s/k\ with s' 



< 



2 n- P n Y^ks'( m \ (26/100) fcs ' < pn2 n - pn E ( ™ ) (26/100) fcs ' 



n 



-2 n -P n (l + (26/W0) k ) m < 
100k v v ' > > - loofc 



s'=0 



2n-pn+(lg e) (26/100) fc m 



2d 



With probability at least 1/2, |7>| < ^ • 2 r ' 



Toofe+ 3 m . We may repeat the algorithm lg - times 
in parallel with independent choices of U and output the smallest partition. This increases the 
probability of success to 1 — q. □ 



3.2 Extended Switching Lemma 

Lemma (Lemma 11.41 Extended Switching Lemma, restated). Let fa, . . . , <p m be a sequence of 
k-CNFs and/or fc-DNFs in the same n variables. For any p < 1/13, let p be a random restriction 
which leaves pn variables unset. The probability that the decision tree for (fa, . . . , (j> m )\ p has a path 
of length > s where each (pi contributes at least one node to the path is at most (13pk) s . 

Our switching lemma is based on Beanie's proof pQ of Razborov's Switching Lemma [16] which 
in turn is based on Hastad's Switching Lemma [5j. The idea is to encode a restriction and a 
corresponding "bad" path in the resulting decision tree using a different restriction which sets 
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additional variables and a few extra bits. The total size needed for this encoding will be much 
less than the size needed to encode the original restriction and this ratio will give the probability 
bound. 

Proof. Let P' be a path in tree((cpi, . . . , 4>m)\p) of length at least t. Let P denote the prefix of P' 
of length exactly t. Let m' denote the index of the last formula (ft m > which contributes a variable to 
P. Let xi, ■ ■ ■ ,xt denote the variables along P and let p±, ... ,pt denote the values that P assigns 
to xi, ■ ■ ■ , xt- Let C\, . . . , Ct and F±, . . . ,F t denote the clauses and formulae respectively which 
contribute xi,...,xt (some CjS and -FjS may refer to the same clauses or formulae if they contribute 
more than one variable to P). Let index±, . . . , indext denote the indices of x\, . . . , x% in the clauses 
Ci, . . . , Ct- Let lasti, 1 < i < t be 2 if X{ is the last variable contributed by Fi along P; be 1 if Xj 
is the last variable contributed by Ci (but not by Fi) along P; and be otherwise. 

Let a = o"i ■ ■ • cr s -t-i where <jj is a restriction where Ui(xi) = if X{ appears positively in Ci 
and o~i(xi) = 1 otherwise, and o~i(y) = * for all y ^ Xi [a is constructed to not satisfy the clauses 
C\, . . . ,C s+ \). Let 7Tj be the restriction where 7Tj(xj) = pi and TTi(y) = * for y / Xj. Note that 

P = 7Tl ■ ■ ■ 7T s+ i 

We map (p, P) to p' = pa £ lZ P n , index = (indexi, . . . , index t ) G [A;]*, ?ast = (lasti, . . . , Zasit) G 
[3]*andp=(p 1 ,...,p t )G [2]*. 

We must now show that we can decode (p, P) from pi , index, last and p. Let /?j = pm • • • TTiCTi+i • • • cr s +i , 
for < i < s + 1. Note that po = /°' an d p s +i = pP ■ 

We will show that for any i < s + 1 given Ci, Fi and /Oj, we can decode 7rj + i and therefore pi+i- 
Then by induction, given p' we can decode /) and P. Let Zasio = 1 and let Fq = (pi. First we 
identify Fi + \. If iasij = 2 then Fj + i = 4> q +\ where q is the index such that Fi = <\> q , and otherwise 
Fi + \ = Fi. If lasti = then we know Cj+i = Cj, otherwise we claim that Cj+i is the first clause 
not satisfied by pi. Once we identify Cj+i, then indexi + \ is the index of Xj+i in this clause and we 
get 7Tj using p^. All that remains is to prove the claim that when lasti = 1 then Cj+i is first clause 
in F not satisfied by pi. 

When Xj+i is queried along the path P when constructing the decision tree for F\ p , C\+i is 
the first clause not satisfied by pit\ ■ ■ ■ 7Tj (otherwise a variable in an earlier clause would have been 
queried instead) . Since setting more variables in a restriction cannot change a clause from satisfied 
to not satisfied, all the clauses in F before Ci are satisfied by pi. All that remains is to show that 
(Tj+i • • • o~ s+ i does not satisfy Cj+i. Let j be the largest index such that Cj = Cj+i, or equivalently 
Xj is the last variable from Cj+i along P. By the construction of Cj+i, . . . ,o~j, o~i + i ■ ■ ■ o~j does not 
satisfy Cj+i and either j = t ox pm ■ ■ ■ vtjO"j + i ■ • • a j sets all of the variables in Cj+i (because of the 
way we construct decision trees). In either case, no an,i > j can satisfy Cj+i so we conclude that 
Pi does not satisfy Ci + \. 

All that remains is to calculate the probability over the choice of p that the path P exists. We 
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bound this probability by the size of the encoding when P exists over the size of the encoding of p. 

g*T* x [fcf x [3]* x [2]'| _ U w -t)2"-^*(6fe) f 



Pr 

p 



tree((0i,...,0 m )| p ) has a 
path of length > s where 
each (pi contributes at least 
.one node to the path 



< 



•n—pn 



\n p n n \ ( n )2 

(pn)\(n-pn)l t 

(pn — t)l(n — pn + t)\ 

pn \ ( pn — 1 \ / pn — t + 1 



< 



n — pn + t J \n — pn + t — 1 J \n — pn + 1 



(12fc) ; 
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4 Correlation & Lower Bounds 

Let C be a depth d size m AC circuit. Theorem 11.21 implies that there exists a set of at most 
2n-n/o(igm+digd) restrictions which partition {0, 1}™ and make C constant. 

Consider the parity functions. Any partition of {0, 1}™ which makes parity constant requires 
2 n restrictions. Setting 2 n-n /°( 1 & m +« 1 g") > 2™ and solving for m in terms of d and vice versa we 
get the following bounds, which match the optimal bounds proved by Hastad [5]: 



Isn r\ I Isn 



Any size poly(n) AC circuit which computes parity requires depth at least ,?" — O 



lglgn \lg 2 lgn/ ' 



Any depth d circuit which computes parity requires size at least 2 



Definition 4.1. Let C denote a class of circuits. Let / : {0, l} n — > {0, 1} be a function. The 
correlation of / with circuits from C is 

maxf Pr [C(x) = f(x)]- Pr [C(x) / fix)] | = max E f(-l) c ^(-l) / ^ 

c&c Vze{o,i}" L v ' y n xe{o,i}" V c-ec xe{o,i} n L v ; v ; 

Proof of Theorem \1.0ft Let C be the class of depth d size en AC circuits. We will bound the 
correlation of C and the parity function. Let C be an element of C and consider the partition 
produce by Theorem 11.21 Each restrictions in this partition which sets fewer than n variables 
contributes to the correlation with parity. Each restriction which sets all n variables contributes 
at most 2 _n to the correlation with parity. Thus the correlation of C with parity is at most 
2-Mc,d« = 2~ n /°( 1 s c+dl s d ) d \ □ 



We give a construction of a family of (n, en, <i)-circuits that both require 2™ n / fJ ( 1 s c ) regions 
and have correlation 2~ n '^' lgc ) with parity. Thus, our algorithm and the implied correlation 
bounds with parity are close to optimal. This construction is generally considered folklore, and a 

more detailed explanation is given in [8]. 

i 

Parity on £ inputs can be computed by depth d size 0(£2 ed ^ 1 ) circuits with either an A or an 
V output gate. Construct a circuit on n inputs by grouping the inputs into ", d -i groups of 

W^lg C) 

£ = & (lg c) ~ inputs. Construct depth d circuits computing the parity of each group each with an 
A output gate. Take the conjunction of all of these circuits. This gives a (n, en, d)-circuit. 
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Each input which sets the circuit to 1 correctly computes parity (assume that the number 
of groups of inputs is odd). At most half the remaining inputs compute parity correctly. The 
fraction of inputs which cause the circuit to output one, and therefore the correlation with parity 

n n n 

is 2 ©(igc)'*- 1 . The number of inputs which cause the circuit to output one is 2 ©(igc) d - 1 _ Since 
flipping any bit on any of these inputs will change the output to 0, each input which causes the 



circuit to output 1 must have its own region, giving a 2 n ( 1 § c ) d ~ 1 lower bound on the number of 

regions. 

Finally, we will sketch how a small but significant improvement in our algorithm combined with 

the results of Williams [2Q|, [2Tj would show new circuit lower bounds. If the satisfiability of size m, 

n n (l — my) 1 

depth d AC circuits in n variables can be decided in time 2 o(igm)°w ^ ^ en NEXP 2 NC . 

Note that any such algorithm cannot do as we did and enumerate satisfying restrictions since there 

can be too many. 

Any NC circuit C with n inputs can be converted to an equivalent depth d size 2 n 

AC circuit, by the following construction which seems to be considered folklore. Let D = O (lgn) 

denote the depth of C . Divide the layers of C into d — 1 groups of -j—r consecutive layers. Replace 

p 
each group of layers by trivial depth two, size poly(n)2 2 ~ circuits, alternating between ANDs of 

ORs and ORs of ANDs, and then combine layers of the same type of gates. Since D = O (lgn), 

°(*=t) n 
the result is a depth d size 2 n AC circuit. 

Williams shows that for any "reasonable" class of circuits C (AC , ACC,NC , P/poly, etc.), if 

the satisfiability of circuits in C with n inputs is in co-nondeterministic time 2 n_n for some constant 

, ra(l — T^y) 

e > then NEXP % C. Running a (hypothetical) 2 o(i g m)°w satisfiability algorithm on the 

(33l) n 1 

size 2 n AC circuit resulting from a NC circuit would give a sufficiently fast satisfiability 

algorithm for Williams' result to show NEXP % NC 1 . 
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